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Abstract — We  address  the  design  of  cognitive  MAC  protocols  for 
opportunistic  spectrum  access  in  the  presence  of  sensing  error.  De¬ 
centralized  protocols  are  presented  for  ad  hoc  networks  where  there 
is  no  central  coordinator  or  dedicated  communication/control  channel. 
Secondary  users  search  for  and  access  spectrum  opportunities  indepen¬ 
dently  without  exchanging  local  information  on  spectrum  availability. 
We  assume  on-demand  partial  spectrum  monitoring  subject  to  sensing 
errors.  Specifically,  a  secondary  user  may  not  be  able  to  simultaneously 
sense  the  whole  spectrum  of  interest  due  to  hardware  limitation  and 
may  only  monitor  the  spectrum  when  it  has  data  to  transmit  in  order  to 
conserve  energy.  We  propose  two  approaches  to  formulating  opportunistic 
spectrum  access  in  the  presence  of  sensing  error  by  considering  either  a 
constraint  on  the  probability  of  colliding  with  primary  users  or  a  penalty 
imposed  on  each  collision.  Under  these  formulations,  we  address  the  joint 
design  of  both  the  spectrum  sensor  at  the  physical  layer  and  the  cognitive 
protocol  at  the  MAC  layer  to  maximize  spectrum  efficiency  while  limiting 
the  interference  to  primary  users. 

I.  Introduction 

Opportunistic  spectrum  access  (OSA),  envisioned  by  the  DARPA 
XG  program  [1],  aims  to  improve  spectrum  efficiency  by  opening 
licensed  spectrum  to  secondary  users  to  exploit  instantaneous  spec¬ 
trum  availability.  This  would  allow  secondary  users  to  identify  and 
access  available  spectrum  resources  provided  that  the  interference  to 
the  primary  users  is  kept  below  a  prescribed  level. 

In  this  paper,  we  focus  on  MAC  design  for  ad  hoc  OSA  networks. 
Recognizing  that  there  is  no  central  coordinator  or  dedicated  com¬ 
munication/control  channel  in  a  secondary  ad  hoc  network,  we  aim 
to  develop  decentralized  MAC  protocols  where  each  secondary  user 
independently  searches  for  spectrum  opportunities  without  relying  on 
cooperation  among  secondary  users.  Furthermore,  we  do  not  assume 
that  each  secondary  user  has  full  knowledge  of  the  availability  of  all 
channels;  such  knowledge  implies  continuous  full-spectrum  sensing 
synchronous  among  secondary  users.  While  simplifying  the  design  of 
OSA  networks,  continuous  full-spectrum  sensing  is  energy  inefficient 
and  hardware  demanding,  especially  for  low-cost  battery-powered 
wireless  nodes  with  bursty  traffic.  We  assume  instead  that  each 
secondary  user  can  choose  to  sense  a  subset  of  the  possible  channels 
(only  when  it  has  data  to  transmit)  and  must  decide  if  transmission 
is  possible  based  on  the  sensing  outcome. 

Under  the  assumption  of  partial  spectrum  monitoring,  we  present 
in  this  paper  a  cross-layer  approach  to  OSA  that  integrates  spectrum 
sensing  with  spectrum  access.  We  address  two  fundamental  issues 
in  decentralized  cognitive  MAC  under  partial  spectrum  monitoring. 
The  first  issue  deals  with  opportunity  identification  and  opportunity 
exploitation.  Our  focus  here  is  how  spectrum  sensing  error  affects 

°This  work  was  supported  by  the  Army  Research  Laboratory  CTA  on 
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result  was  submitted  to  IEEE  Journal  on  Selected  Areas  in  Communications. 


the  overall  network  performance  and  how  to  combat  sensing  error  at 
both  the  physical  and  MAC  layers  through  a  joint  design  of  spectrum 
sensors  and  OSA  protocols.  By  casting  this  problem  in  the  framework 
of  Partially  Observable  Markov  Decision  Process  (POMDP),  we 
obtain  spectrum  sensing  and  access  strategies  as  well  as  the  design  of 
spectrum  sensor.  We  point  out  that  although  the  issue  of  sensing  error 
has  been  investigated  recently  (see  [2]),  it  has  not  been  integrated  into 
the  design  of  cognitive  MAC  protocols. 

The  second  issue  is  transmitter-receiver  synchronization,  which  is 
unique  to  MAC  design  in  OSA  networks.  When  a  secondary  user 
hops  in  the  spectrum,  seeking  opportunities  that  are  time-varying  and 
location-dependent,  its  intended  receiver  needs  to  hop  synchronously 
in  order  to  carry  out  the  communication.  In  an  ad  hoc  OSA  network 
with  collisions  and  spectrum  sensing  errors,  maintaining  transceiver 
synchronization  without  introducing  extra  control  message  exchange 
is  nontrivial.  We  show  that  the  proposed  decentralized  MAC  protocols 
synchronize  opportunistic  users  in  the  presence  of  collisions  and 
sensing  errors. 

Related  Work  OSA  has  received  increasing  attention  [3].  The  pre¬ 
vailing  approach  to  OSA  tackles  network  design  in  two  separate  steps: 

(1)  opportunity  identification  assuming  continuous  full-spectrum  sens¬ 
ing;  (ii)  opportunity  allocation  among  secondary  users  assuming  full 
knowledge  of  spectrum  opportunities.  Opportunity  identification  in 
the  presence  of  fading  and  noise  uncertainty  has  been  studied  in 

[2] ,  [4] — [8] .  Decentralized  opportunity  allocation  strategies  can  be 
found  in  [9] — [14]  and  references  therein.  In  [15] — [18],  Centralized 
spectrum  sharing  protocols  with  a  central  coordinator  or  a  dedicated 
control  channel  are  proposed.  Spectrum  sharing  and  power  control 
for  a  group  of  interfering  users  are  studied  in  [19] — [22] .  Analytical 
results  on  OSA  networks  can  be  found  in  [23]  and  [24],  In  [23],  the 
capacity  of  multi-hop  OSA  networks  is  analyzed  using  a  Markovian 
model  on  the  spectrum  occupancy  of  the  primary  users.  In  [24],  the 
authors  identify  the  impact  of  channel  heterogeneity  resulted  from 
OSA  on  network  performance  and  higher-layer  protocol  design.  This 
paper  is  based  on  results  obtained  in  [25],  [26].  The  focus  of  this 
paper  is  different  formulations  of  and  design  strategies  for  OSA  in 
the  presence  of  spectrum  sensing  errors. 

II.  The  Network  and  Protocol  Model 

The  Network  Model  Consider  a  spectrum  consisting  of  N  channels1, 
each  with  bandwidth  Bi  ( i  =  1,  ■  •  •  ,  IV).  These  N  channels  are 
licensed  to  a  primary  network  whose  users  communicate  according 
to  a  synchronous  slot  structure.  The  traffic  statistics  of  the  primary 

1  Here  we  use  the  term  channel  broadly.  A  channel  can  be  a  frequency  band 
with  certain  bandwidth.  It  can  also  be  a  collection  of  spreading  codes  in  a 
CDMA  network  or  a  set  of  tones  in  an  OFDM  system. 
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network  are  such  that  the  occupancy  of  these  N  channels  follows 
a  discrete-time  Markov  process  with  M  =  2N  states2.  Specifically, 
the  network  state  in  slot  t  is  given  by  [Si  ((),•••  ,  Sjv^)]4  where 
Si(t)  £  {0  (occupied)  ,  1  (idle)  }.  We  assume  that  the  spectrum 
usage  statistics  of  the  primary  network  remain  unchanged  for  T  slots. 

We  consider  a  secondary  network  seeking  spectrum  opportunities 
in  these  N  channels.  We  focus  on  an  ad  hoc  network  where 
secondary  users  join/exit  the  network  and  sense/access  the  spectrum 
independently  without  exchanging  local  information.  In  each  slot, 
a  secondary  user  chooses  a  set  of  channels  to  sense  and  a  set  of 
channels  to  access.  Limited  by  its  hardware  constraints  and  energy 
supply,  a  secondary  user  can  sense  no  more  than  Li  (L\  <  N)  and 
access  no  more  than  L2  (L2  <  Lf)  channels  in  each  slot. 

Our  goal  is  to  develop  cognitive  MAC  protocols  for  the  secondary 
network.  For  an  ad  hoc  OSA  network  without  a  central  coordinator 
or  a  dedicated  communication/control  channel,  it  is  desirable  to  have 
a  decentralized  MAC  protocol  where  each  secondary  user  indepen¬ 
dently  searches  for  spectrum  opportunities,  aiming  at  optimizing 
its  own  performance.  Such  decentralized  protocols  do  not  rely  on 
cooperation  among  secondary  users. 

The  Basic  Protocol  Structure  Without  delving  into  protocol  details 
(which  can  be  found  in  [25],  [27]),  we  present  here  the  basic  protocol 
structure.  At  the  beginning  of  each  slot3,  a  secondary  user  with  data 
to  transmit  chooses  a  set  of  channels  to  sense  and  a  set  of  channels 
to  access  based  on  the  sensing  outcome4.  Such  spectrum  sensing  and 
access  decisions  are  made  to  maximize  the  spectrum  efficiency  while 
limiting  the  interference  to  the  primary  network  by  fully  exploiting 
the  sensing  history  and  the  spectrum  occupancy  statistics.  When  the 
secondary  user  decides  to  transmit,  it  generates  a  random  backoff 
time,  and  transmits  when  this  timer  expires  and  no  other  secondary 
user  has  already  accessed  the  channel  during  its  backoff  time.  At 
the  end  of  the  slot,  the  receiver  acknowledges  a  successful  data 
transmission. 

III.  A  Decision-Theoretic  Approach  Based  on  POMDP 

In  this  section,  we  develop  a  decision-theoretic  approach  to  MAC 
design  in  OSA  networks.  We  show  that  the  OSA  network  specified 
in  Section  II  can  be  modelled  by  a  POMDP  and  the  spectrum  sensing 
and  access  component  of  a  MAC  protocol  corresponds  to  a  policy  for 
this  POMDP.  Existing  techniques  and  results  for  POMDP  can  then 
be  used  to  develop  MAC  protocols  for  OSA  networks. 

A.  A  POMDP  Framework 

Consider  the  Markovian  dynamics  of  the  OSA  network  specified  in 
Section  II.  At  the  beginning  of  each  slot,  a  secondary  user  chooses  a 
set  ,4i  (|^4i|  <  Li)  of  channels  to  sense.  Given  that  the  current  state 
of  the  underlying  Markov  process  is  j,  the  user  observes  0j,Ai  € 
{0,  l}!-41!  which  indicates  the  availability  of  each  sensed  channel. 
Based  on  this  observation,  the  user  chooses  a  set  A2  C  Al  (|^2  |  < 
1/2 )  of  channels  to  access.  For  the  chosen  action,  the  user  receives  a 
reward  Rj,Ai,A2  at  the  end  of  this  slot.  The  sequence  of  operations 
in  each  slot  is  illustrated  in  Figure  1. 

The  objective  is  to  choose  the  sensing  and  access  action  {Ai,  A2} 
sequentially  in  each  slot  so  that  the  total  expected  reward  accumulated 

2  Since  the  unavailability  of  a  channel  may  also  be  caused  by  channel  fading, 
the  Markov  chain  model  can  also  include  fading  statistics. 

3The  slot  information  can  be  broadcasted  by  the  primary  network. 

4  In  a  multihop  ad  hoc  networks  where  realizations  of  spectrum  opportuni¬ 
ties  are  location  dependent,  spectrum  sensing  needs  to  be  carried  out  at  both 
the  transmitter  and  the  receiver.  Details  can  be  found  in  [27], 


Fig.  1.  The  sequence  of  operations  in  a  slot. 

over  T  slots  (wherein  the  spectrum  occupancy  statistics  remain 
unchanged)  is  maximized.  We  now  have  a  POMDP  since,  in  general, 
the  network  state  cannot  be  fully  observed  due  to  partial  spectrum 
monitoring  (|.Ai|  <  L 1  <  N). 

For  a  POMDP,  the  internal  state  of  the  underlying  Markov  process 
is  unknown.  At  the  beginning  of  slot  t,  our  knowledge  of  the  internal 
state  of  the  network  based  on  all  past  decisions  and  observations 
can  be  summarized  by  a  belief  vector  A(t)  =  [Ai(t),  •  •  •  ,  A m(()] 
where  A j(t)  is  the  conditional  probability  (given  the  decision  and 
observation  history)  that  the  network  state  is  j  at  the  beginning  of 
slot  t  prior  to  the  state  transition  (see  Figure  1). 

It  has  been  shown  by  Smallwood  and  Sondik  [28]  that  for  any  t, 
the  belief  vector  A(t)  is  a  sufficient  statistic  for  the  design  of  the 
optimal  action  {.Ai,  A2}  in  slot  t.  A  policy  n  for  a  POMDP  is  thus 
given  by  a  sequence  of  functions,  each  mapping  from  the  current 
belief  vector  A(t)  to  the  sensing  and  access  action  {Ai{t) ,  Azit)} 
to  be  taken  in  slot  f,  i.e., 

7T  =  [/ii,  •  •  •  ,/ir],  where  [m  ■  A(t)  £  [0, 1]M  — »  {Ai(t),  -42(t)}. 

(1) 

Under  this  formulation,  a  spectrum  sensing  and  access  strategy  is 
essentially  a  policy  of  this  POMDP  over  a  finite  horizon. 

Note  that  the  dimension  of  the  belief  vector  grows  exponentially 
with  the  number  N  of  channels.  It  has  been  shown  in  [25]  that  when 
channels  evolve  independently,  we  can  find  a  sufficient  statistic  for  the 
optimal  policy  whose  dimension  grows  linearly  with  N.  Specifically, 
let  O  =  [wi ,  •  •  •  ,  Wat]  where  u >i  is  the  probability  (conditioned  on 
the  sensing  and  decision  history)  that  channel  i  is  available  at  the 
beginning  of  a  slot.  Then  Q.  is  a  sufficient  statistic  for  the  optimal 
OSA  protocol  under  N  independent  channels. 

B.  OSA  in  the  Presence  of  Sensing  Error 

In  the  presence  of  sensing  error,  not  only  the  sensing  and  access 
strategy  but  also  the  operating  characteristics  of  the  spectrum  sensor 
affect  the  performance  of  the  OSA  network  and  the  interference 
perceived  by  the  primary  network.  The  problem  is  thus  a  joint  design 
of  the  spectrum  sensor  and  the  OSA  MAC  protocol. 

Spectrum  sensors  perform  a  binary  hypotheses  test:  Ho  (null 
hypothesis  indicating  that  the  sensed  channel  is  available)  vs.  Hi 
(alternative).  If  the  sensor  of  a  secondary  user  mistakes  Ho  for  Hi 
(false  alarm),  the  secondary  user  will  refrain  from  transmitting,  and  a 
spectrum  opportunity  is  overlooked.  On  the  other  hand,  if  the  detector 
mistakes  Hi  for  Ho  (miss  detection),  a  misidentification  of  spectrum 
opportunity  occurs;  the  secondary  user  collides  with  a  primary  user. 
Let  t  and  8  denote,  respectively,  the  overlook  (false  alarm)  and 
misidentification  (miss  detection)  probabilities.  The  performance  of 
the  sensor  is  specified  by  the  Receiver  Operating  Characteristics 
(ROC)  curve  which  gives  8  as  a  function  of  e  (examples  are  given 
in  Figure  2). 

There  are  two  ways  of  formulating  OSA  in  the  presence  of  sensing 
error.  In  the  first  formulation,  the  objective  is  to  design  the  optimal 
spectrum  sensing  and  access  policy  7r*  and  the  operating  point 
<5*  (on  the  ROC  curve)  of  the  spectrum  sensor  to  maximize  the 


expected  total  reward  obtained  by  a  secondary  user  under  a  constraint 
on  the  probability  of  collision.  Specifically,  we  define  the  reward 
Rj,  -Ai  ,a2  (t)  as  the  number  of  bits  delivered  when  the  secondary  user 
senses  channels  in  Ai  and  transmits  using  channels  in  A 2  given  that 
the  network  is  in  state  j.  Assume  that  the  number  of  bits  delivered 
over  a  channel  in  one  slot  is  proportional  to  its  bandwidth.  The  reward 
is  given  by5 

Rj,Ai,Aa(t)  =  Si(t)Bu  (2) 

i£A2 

where  Si(t)  G  {0, 1}  is  the  state  of  channel  i  in  slot  t. 

Let  £  denote  the  maximum  probability  of  collision  allowed  by 
the  primary  network.  The  objective  is  to  design  the  optimal  spectrum 
sensing  and  access  policy  7r*  and  the  operating  point  <5*  (on  the  ROC 
curve)  of  the  spectrum  sensor.  Specifically, 

T 

{tt*,  A*}  =  argmaxE^[y'j?J-(t))^l(t),^2(t)  I  A(l)], 

subject  to  Pc  <  (j,  (3) 

where  represents  the  conditional  expectation  given  that  policy  n 
is  employed,  Pc  is  the  probability  of  collision,  and  A(l)  the  initial 
belief  vector  which  can  be  the  stationary  distribution  of  the  network 
state.  Note  that  the  probability  of  collision  Pc  depends  on  the  sensing 
and  access  policy  n  as  well  as  the  operating  characteristic  of  the 
spectrum  sensor. 

An  alternative  formulation  is  to  impose  penalty  when  collisions 
with  primary  users  occur.  Specifically,  we  define  the  reward  as 

Rj.Ai.Aiit)  =  ^  (1 - (4) 

i£A-2  A2 

where  Ci  >  0  is  the  cost  associated  with  colliding  with  a  primary  user 
in  channel  i.  We  thus  arrive  at  an  unconstrained  formulation  where 
the  objective  is  to  maximize  the  expected  total  reward  accumulated 
in  T  slots: 

T 

=  argmaxE^[Vi?j(t)j^l(t)^2(f)  |  A(l)].  (5) 

1  ’  S  t=  1 

Note  that  the  penalty  Ci  associated  with  collision  determines  the  level 
of  interference  perceived  by  the  primary  network.  In  this  paper,  we 
assume  {Ci}fLi  is  given. 

For  ease  of  presentation,  we  assume  in  the  rest  of  the  paper  that 
L\  =  L2  =  1.  In  this  case,  the  action  taken  in  each  slot  consists  of 
the  index  a  G  { 1 ,  •  •  •  ,N}  of  the  channel  to  be  monitored  and  the 
decision  <b0  G  {0  (no  access)  ,  1  (access)}  on  whether  to  transmit. 
Results  obtained  in  this  paper  can  be  readily  extended  to  general 
cases. 


operating  point  based  on  the  constraint  on  the  probability  of  collision: 
<5*  =  £.  In  this  case,  the  optimal  access  policy  is  given  by 


$ 


a 


1  if  ©a  =  1 

0  if  ea  =  0 


(6) 


Since  the  probability  of  misidentification  of  the  spectrum  sensor  is 
£,  the  probability  of  colliding  with  a  primary  user  under  this  access 
strategy  is  £,  satisfying  the  design  constraint.  The  problem  is  then 
reduced  to  an  unconstrained  POMDP  where  the  optimal  policy  for 
channel  selection  is  obtained  to  maximize  the  throughput  of  the 
secondary  user. 

Both  the  optimal  and  suboptimal  approaches  proposed  in  [25] 
under  the  assumption  of  perfect  sensing  can  be  extended  to  incor¬ 
porate  overlook  and  misidentification  of  spectrum  opportunities.  We 
consider  here  the  suboptimal  greedy  strategy  that  maximizes  the  per- 
slot  throughput.  We  assume  that  channels  evolve  independently  and 
design  the  sensing  and  access  strategy  based  on  the  reduced-state 
sufficient  statistic  fi. 

Assume  that  channel  i  transits  from  state  0  (unavailable)  to  state  1 
(available)  with  probability  at  and  stays  in  state  1  with  probability  ,3, . 
Let  Ua  denote  the  number  of  bits  that  can  be  successfully  delivered 
if  channel  a  is  chosen  in  a  particular  slot.  Given  that  our  knowledge 
of  the  network  state  is  fl  at  the  beginning  of  this  slot  prior  to  the 
state  transition,  the  channel  a*  to  be  selected  for  maximizing  per-slot 
throughput  is  given  by 


a*  =  are  max  E[{70|f2] 

0=1, ■■■  ,N 

=  arg  max  ^  { B„  Pr[Sa  =  1,  0O  =  l|fi]} 

=  arg  max  {Ba{l  -  e)(oja/3a  +  (1  -  uja)aa)}-  (7) 

a= l,---  ,N 

At  the  end  of  this  slot,  the  belief  vector  Q  is  updated  based 
on  the  information  gained  in  this  slot.  The  information  gained  at 
the  transmitter  includes  the  decision  {a*,<I>a,»}  and  the  observation 
{Oa„ ,  Ka, }  where  Ka,  G  {0, 1}  indicates  whether  an  acknowledge¬ 
ment  is  received  at  the  end  of  this  slot6.  The  information  gained  at 
the  receiver,  however,  includes  only  a*  and  Ka ,  since  the  receiver 
does  not  have  the  sensing  outcome  Oa„  at  the  transmitter  (due  to 
sensing  error)  and  cannot  distinguish  an  unsuccessful  transmission 
from  the  no-access  decision  <f>0„  =  0  of  the  transmitter.  In  order  for 
the  transmitter  and  the  receiver  to  arrive  at  the  same  belief  vector 
Q(t  +  1),  which  ensures  that  they  tune  to  the  same  channel  in  the 
next  slot  (see  (7)),  the  belief  vector  should  be  updated  at  both  the 
transmitter  and  the  receiver  using  only  a*  and  Ka,  .  The  belief  vector 
Q(t+  1)  is  thus  given  by 

fl(t  +  1)  =  T(fi(f)|a*,  Ka,)  =  [wi (t  +  1),  •  •  •  ,  t +  1)] 

Ui{t  +  1)  =  Pr[S}(t)  =  l|fi(t),  a„Ka,] 

f  u>i(t)@i  +  (1  —  u>i(t))oti,  if  a,  7^i 


IV.  Spectrum  Sensing  and  Access  under  the 
Constrained  Formulation 

The  problem  formulated  in  (3)  is  a  constrained  POMDP  which 
generally  requires  a  randomized  optimal  policy.  To  obtain  a  determin¬ 
istic  strategy  with  low  complexity,  we  aim  at  separating  the  objective 
function  of  (3)  from  the  constraint.  Specifically,  we  choose  the  sensor 

5  In  this  paper,  we  focus  on  decentralized  cognitive  MAC  where  secondary 
users  make  independent  and  selfish  decisions  without  coordination.  In  this 
case,  a  secondary  user  chooses  its  spectrum  sensing  and  access  strategies 
under  the  assumption  that  it  will  receive  a  reward  when  the  chosen  channel 
is  not  used  by  the  primary  network. 


if  a,  =  i,  Ka ,  =  1 

_ fig*  +(1  — uig*  )gaHt  ) _ 

+  (1— wa*  )aa*  )  +  (^aHt  (1  —  Pa*  )  +  (l  —  ‘■‘•’a*  )(1_ “a,  )) 

if  a*  =  i,  Ka,  =  0 


(8) 


A  detailed  derivation  of  (8)  can  be  found  in  [26]. 

We  can  see  from  (8)  that  the  transmitter  and  the  receiver  maintain 
the  same  belief  vector  independent  of  collision  and  sensing  error. 
Since  the  channel  selection  is  determined  by  the  belief  vector, 
transceiver  synchronization  is  maintained. 


6The  transmission  of  acknowledgement  is  assumed  to  be  error-free. 


The  above  specifies  the  spectrum  sensing  and  access  strategy  when 
the  spectrum  sensor  operates  at  5  =  £  (see  Figure  2).  We  point  out 
that  this  approach  reduces  a  constrained  POMDP  to  an  unconstrained 
one  by  separating  the  design  of  the  spectrum  sensor  from  that  of 
the  sensing  and  access  strategy,  resulting  in  low-complexity  but 
potentially  suboptimal  design.  In  our  future  work,  we  will  pursue 
the  optimal  joint  design  of  the  spectrum  sensor  and  the  OSA  MAC 
and  study  the  tradeoff  between  complexity  and  optimality. 

V.  Spectrum  Sensing  and  Access  under  the 
Unconstrained  Formulation 

In  this  section,  we  consider  the  unconstrained  formulation  given  in 
(5).  The  optimization  in  (5)  can  be  separated  into  two  steps:  obtain  the 
optimal  policy  7r*  ($)  for  a  give  spectrum  sensor  5  and  then  optimize 
over  S.  We  address  first  the  design  of  the  sensing  and  access  strategy 
for  a  given  sensor.  We  focus  on  the  greedy  approach  that  aims  at 
maximizing  the  expected  reward  in  each  slot. 

We  first  specify  if  channel  a  is  chosen,  whether  the  secondary  user 
should  transmit  based  a  given  sensing  outcome.  Let  Ra  denote  the 
reward  obtained  in  this  slot  if  the  user  transmits  in  channel  a.  Clearly, 
the  optimal  access  decision  <1?,!  for  sensing  outcome  0a  =  1  is  given 
by 

E[Ra\ea  =  l,fi]  0,  (9) 

where  E[f?a|0a  =  1,  O]  can  be  obtained  as 

E[fla|ea  =  i,n] 

=  Pr[Sa  =  l|0a  =  1,  n]Ba  -  Pr[Sa  =  O|0a  =  1,  fi]Ca 

_  (Ugpa  +  (1  -  Ulq)aa)(  1  -  e)Ba  -  (ua(  1  -  0a)  +  (1  -  <ua)(l  -  a„))<5C,-, 
(ua0a  +  (1  —  k>a)aa)(l  —  e)  +  (tua(l  -  /3„)  +  (1  —  cua)(  1  —  aa))5 

The  access  decision  for  sensing  outcome  0a  =  0  can  be  similarly 
obtained.  The  optimal  channel  selection  for  maximizing  the  expected 
reward  in  this  slot  is  thus  given  by 

o*  =  argmaxI[E[i?a|eo=iin]>o]E[f?.a|0a  =  1,12]  Pr[0a  =  l|fi] 

+I[E[iJa|ea=O,n]>O]E[f?a|0a  =  0,  n]  Pr[0a  =  0|O], 

where  Hjx]  is  the  indicator  function,  Pr[0a  =  l|f2]  is  given  by  the 
denominator  of  (10),  and  Pr[0a  =  0|fJ]  can  be  similarly  obtained. 
At  the  end  of  this  slot,  the  belief  vector  12  is  updated  based  on  a* 
and  Kat  as  shown  in  (8). 

The  above  specifies  the  spectrum  sensing  and  access  strategy  based 
on  the  greedy  approach  for  a  given  spectrum  sensor  5  under  the 
unconstrained  formulation.  The  optimal  operating  point  5 *  can  be 
obtained  numerically  via  Monte  Carlo  techniques. 

VI.  Numerical  Results 

In  this  section,  we  present  numerical  results  to  evaluate  the 
performance  of  the  proposed  optimal  and  suboptimal  cognitive  MAC 
protocols.  We  focus  on  the  constrained  formulation. 

At  the  beginning  of  each  slot,  the  sensor  takes  L  measurements 
{Yi}i=1  of  the  chosen  channel.  We  assume  that  both  the  channel 
noise  and  the  signal  of  primary  users  can  be  modelled  as  white 
Gaussian  processes.  We  then  have 

{Ho  (when  channel  is  idle)  :  Yt  ~  Af(0,  ctq),  i  =  1,  ■  •  •  ,  L 
Hi  (when  channel  is  busy)  :  Yt  ~  A/”(0,  crj ) ,  i  =  1,  ■  •  •  ,  L 

The  SNR  is  given  by  (cti  —  erg )  /(To .  It  can  be  shown  that  the  Neyman- 
Pearson  detector  for  this  problem  is  given  by  the  energy  detector: 


The  probability  e  of  false  alarm  (opportunity  overlook)  is  determined 
by  the  threshold  r: 

eAFr(||Y||“>r|//0)  =  l-r(^,5U),  (12) 

where  F(L,x)  =  tL~1e~t  dt  is  the  incomplete  gamma  function. 
The  ROC  for  the  Neyman-Pearson  testing  is  thus  given  by 

1  -  «  =  Pr{||Y||2  >  t\Hi}  =  1  -  T  (§,,*)  ,  (13) 

where  r/  satisfies  r  =  1  —  e.  The  ROC  curves  for  different 

SNRs  and  numbers  L  of  samples  are  shown  in  Figure  2. 

In  Figure  3  we  study  the  performance  of  the  proposed  greedy 
approach  (using  the  above  specified  spectrum  sensors)  as  a  function  of 
the  maximum  collision  probability  C,  allowed  by  the  primary  network. 
In  the  upper  plot,  we  focus  on  the  secondary  user.  We  can  see  that 
as  £  increases,  the  throughput  of  the  secondary  user  approaches  to 
the  performance  achieved  by  the  optimal  protocol  in  the  absence 
of  sensing  errors  (the  optimal  protocol  is  given  in  [25]).  This  is 
because  with  a  large  £,  the  probability  e  of  overlook  can  be  very 
small,  leading  to  improved  throughput  for  the  secondary  user  at  a 
price  of  more  collisions  with  the  primary  network.  As  shown  in  the 
lower  plot,  due  to  frequent  collisions,  the  overall  spectrum  efficiency 
considering  both  the  primary  and  secondary  users  decreases  when  £ 
is  large.  The  best  spectrum  efficiency  is  obtained  when  £  is  around 
0.15. 


Fig.  2.  ROC  curves  for  detecting  Gaussian  signal  in  Gaussian  noise. 


VII.  Conclusion 

We  address  in  this  paper  opportunistic  spectrum  access  in  the 
presence  of  sensing  error.  Two  formulations  of  this  problem  are 
proposed.  Under  the  constrained  formulation,  the  objective  is  to 
maximize  the  throughput  of  a  secondary  user  subject  to  a  constraint 
on  the  maximum  probability  of  collision.  For  the  unconstrained 
formulation,  a  penalty  is  imposed  on  each  collision.  Strategies  for 
designing  both  the  spectrum  sensor  and  the  OSA  MAC  protocol  are 
proposed  under  these  two  formulations. 
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